Dataset statistics
| Number of variables | 22 |
|---|---|
| Number of observations | 1296675 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 217.6 MiB |
| Average record size in memory | 176.0 B |
Variable types
| Categorical | 13 |
|---|---|
| Numeric | 9 |
trans_date_trans_time has a high cardinality: 1274791 distinct values | High cardinality |
merchant has a high cardinality: 693 distinct values | High cardinality |
first has a high cardinality: 352 distinct values | High cardinality |
last has a high cardinality: 481 distinct values | High cardinality |
street has a high cardinality: 983 distinct values | High cardinality |
city has a high cardinality: 894 distinct values | High cardinality |
state has a high cardinality: 51 distinct values | High cardinality |
job has a high cardinality: 494 distinct values | High cardinality |
dob has a high cardinality: 968 distinct values | High cardinality |
trans_num has a high cardinality: 1296675 distinct values | High cardinality |
zip is highly correlated with long and 1 other fields | High correlation |
lat is highly correlated with merch_lat | High correlation |
long is highly correlated with zip and 1 other fields | High correlation |
merch_lat is highly correlated with lat | High correlation |
merch_long is highly correlated with zip and 1 other fields | High correlation |
zip is highly correlated with long and 1 other fields | High correlation |
lat is highly correlated with merch_lat | High correlation |
long is highly correlated with zip and 1 other fields | High correlation |
merch_lat is highly correlated with lat | High correlation |
merch_long is highly correlated with zip and 1 other fields | High correlation |
zip is highly correlated with long and 1 other fields | High correlation |
lat is highly correlated with merch_lat | High correlation |
long is highly correlated with zip and 1 other fields | High correlation |
merch_lat is highly correlated with lat | High correlation |
merch_long is highly correlated with zip and 1 other fields | High correlation |
state is highly correlated with zip and 5 other fields | High correlation |
zip is highly correlated with state and 4 other fields | High correlation |
lat is highly correlated with state and 4 other fields | High correlation |
long is highly correlated with state and 4 other fields | High correlation |
city_pop is highly correlated with state | High correlation |
merch_lat is highly correlated with state and 4 other fields | High correlation |
merch_long is highly correlated with state and 4 other fields | High correlation |
amt is highly skewed (γ1 = 42.27787379) | Skewed |
trans_date_trans_time is uniformly distributed | Uniform |
trans_num is uniformly distributed | Uniform |
trans_num has unique values | Unique |
Reproduction
| Analysis started | 2023-02-01 11:25:13.936968 |
|---|---|
| Analysis finished | 2023-02-01 11:27:14.744551 |
| Duration | 2 minutes and 0.81 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 1274791 |
|---|---|
| Distinct (%) | 98.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.9 MiB |
| 2020-06-02 12:47:07 | 4 |
|---|---|
| 2020-06-01 01:37:47 | 4 |
| 2019-04-22 16:02:01 | 4 |
| 2019-12-01 19:39:27 | 3 |
| 2019-01-01 16:52:19 | 3 |
| Other values (1274786) |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 1253218 ? |
|---|---|
| Unique (%) | 96.6% |
Sample
| 1st row | 2019-01-01 00:00:18 |
|---|---|
| 2nd row | 2019-01-01 00:00:44 |
| 3rd row | 2019-01-01 00:00:51 |
| 4th row | 2019-01-01 00:01:16 |
| 5th row | 2019-01-01 00:03:06 |
Common Values
| Value | Count | Frequency (%) |
| 2020-06-02 12:47:07 | 4 | < 0.1% |
| 2020-06-01 01:37:47 | 4 | < 0.1% |
| 2019-04-22 16:02:01 | 4 | < 0.1% |
| 2019-12-01 19:39:27 | 3 | < 0.1% |
| 2019-01-01 16:52:19 | 3 | < 0.1% |
| 2019-12-30 15:25:56 | 3 | < 0.1% |
| 2020-05-29 18:21:24 | 3 | < 0.1% |
| 2019-12-31 21:33:30 | 3 | < 0.1% |
| 2019-11-18 23:03:49 | 3 | < 0.1% |
| 2019-07-21 14:05:37 | 3 | < 0.1% |
| Other values (1274781) | 1296642 |
Length
| Value | Count | Frequency (%) |
| 2019-12-08 | 6428 | 0.2% |
| 2019-12-15 | 6425 | 0.2% |
| 2019-12-22 | 6325 | 0.2% |
| 2019-12-29 | 6320 | 0.2% |
| 2019-12-01 | 6283 | 0.2% |
| 2019-12-09 | 6252 | 0.2% |
| 2019-12-02 | 6150 | 0.2% |
| 2019-12-16 | 6127 | 0.2% |
| 2019-12-30 | 6064 | 0.2% |
| 2019-12-23 | 5937 | 0.2% |
| Other values (86927) | 2531039 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
cc_num
Real number (ℝ≥0)
| Distinct | 983 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.171920421 × 1017 |
| Minimum | 6.041620718 × 1010 |
|---|---|
| Maximum | 4.992346398 × 1018 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.9 MiB |
Quantile statistics
| Minimum | 6.041620718 × 1010 |
|---|---|
| 5-th percentile | 6.304848798 × 1011 |
| Q1 | 1.800429465 × 1014 |
| median | 3.521417321 × 1015 |
| Q3 | 4.642255475 × 1015 |
| 95-th percentile | 4.497913966 × 1018 |
| Maximum | 4.992346398 × 1018 |
| Range | 4.992346338 × 1018 |
| Interquartile range (IQR) | 4.462212529 × 1015 |
Descriptive statistics
| Standard deviation | 1.308806447 × 1018 |
|---|---|
| Coefficient of variation (CV) | 3.1371798 |
| Kurtosis | 6.179949935 |
| Mean | 4.171920421 × 1017 |
| Median Absolute Deviation (MAD) | 3.076470873 × 1015 |
| Skewness | 2.851879006 |
| Sum | -6.725541877 × 1018 |
| Variance | 1.712974316 × 1036 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5.713652351 × 1011 | 3123 | 0.2% |
| 4.512828415 × 1018 | 3123 | 0.2% |
| 3.672269902 × 1013 | 3119 | 0.2% |
| 2.131124026 × 1014 | 3117 | 0.2% |
| 3.54510934 × 1015 | 3113 | 0.2% |
| 6.534628261 × 1015 | 3112 | 0.2% |
| 6.011367958 × 1015 | 3110 | 0.2% |
| 2.720433096 × 1015 | 3107 | 0.2% |
| 6.011438889 × 1015 | 3106 | 0.2% |
| 6.011109737 × 1015 | 3101 | 0.2% |
| Other values (973) | 1265544 |
| Value | Count | Frequency (%) |
| 6.041620718 × 1010 | 1518 | |
| 6.042292873 × 1010 | 1531 | |
| 6.042309813 × 1010 | 510 | < 0.1% |
| 6.042785159 × 1010 | 528 | < 0.1% |
| 6.048700208 × 1010 | 496 | < 0.1% |
| 6.049059630 × 1010 | 1010 | |
| 6.049559311 × 1010 | 518 | < 0.1% |
| 5.018029536 × 1011 | 1559 | |
| 5.018181333 × 1011 | 8 | < 0.1% |
| 5.018282048 × 1011 | 515 | < 0.1% |
| Value | Count | Frequency (%) |
| 4.992346398 × 1018 | 2059 | |
| 4.989847571 × 1018 | 1007 | 0.1% |
| 4.980323468 × 1018 | 532 | < 0.1% |
| 4.973530368 × 1018 | 1040 | |
| 4.958589672 × 1018 | 1476 | |
| 4.95682899 × 1018 | 2566 | |
| 4.911818931 × 1018 | 9 | < 0.1% |
| 4.906628656 × 1018 | 2584 | |
| 4.897067971 × 1018 | 1038 | |
| 4.890424427 × 1018 | 1496 |
| Distinct | 693 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.9 MiB |
| fraud_Kilback LLC | 4403 |
|---|---|
| fraud_Cormier LLC | 3649 |
| fraud_Schumm PLC | 3634 |
| fraud_Kuhn LLC | 3510 |
| fraud_Boyer PLC | 3493 |
| Other values (688) |
Length
| Max length | 43 |
|---|---|
| Median length | 20 |
| Mean length | 23.13259683 |
| Min length | 13 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | fraud_Rippin, Kub and Mann |
|---|---|
| 2nd row | fraud_Heller, Gutmann and Zieme |
| 3rd row | fraud_Lind-Buckridge |
| 4th row | fraud_Kutch, Hermiston and Farrell |
| 5th row | fraud_Keeling-Crist |
Common Values
| Value | Count | Frequency (%) |
| fraud_Kilback LLC | 4403 | 0.3% |
| fraud_Cormier LLC | 3649 | 0.3% |
| fraud_Schumm PLC | 3634 | 0.3% |
| fraud_Kuhn LLC | 3510 | 0.3% |
| fraud_Boyer PLC | 3493 | 0.3% |
| fraud_Dickinson Ltd | 3434 | 0.3% |
| fraud_Cummerata-Jones | 2736 | 0.2% |
| fraud_Kutch LLC | 2734 | 0.2% |
| fraud_Olson, Becker and Koch | 2723 | 0.2% |
| fraud_Stroman, Hudson and Erdman | 2721 | 0.2% |
| Other values (683) | 1263638 |
Length
| Value | Count | Frequency (%) |
| and | 474111 | 15.7% |
| llc | 97780 | 3.2% |
| inc | 91939 | 3.0% |
| sons | 73145 | 2.4% |
| ltd | 70853 | 2.3% |
| plc | 66475 | 2.2% |
| group | 50447 | 1.7% |
| fraud_kutch | 10560 | 0.3% |
| fraud_schaefer | 9394 | 0.3% |
| fraud_streich | 9250 | 0.3% |
| Other values (804) | 2069403 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
category
Categorical
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.9 MiB |
| gas_transport | |
|---|---|
| grocery_pos | |
| home | |
| shopping_pos | |
| kids_pets | |
| Other values (9) |
Length
| Max length | 14 |
|---|---|
| Median length | 11 |
| Mean length | 10.52607862 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | misc_net |
|---|---|
| 2nd row | grocery_pos |
| 3rd row | entertainment |
| 4th row | gas_transport |
| 5th row | misc_pos |
Common Values
| Value | Count | Frequency (%) |
| gas_transport | 131659 | |
| grocery_pos | 123638 | |
| home | 123115 | |
| shopping_pos | 116672 | |
| kids_pets | 113035 | |
| shopping_net | 97543 | |
| entertainment | 94014 | |
| food_dining | 91461 | 7.1% |
| personal_care | 90758 | 7.0% |
| health_fitness | 85879 | 6.6% |
| Other values (4) | 228901 |
Length
| Value | Count | Frequency (%) |
| gas_transport | 131659 | |
| grocery_pos | 123638 | |
| home | 123115 | |
| shopping_pos | 116672 | |
| kids_pets | 113035 | |
| shopping_net | 97543 | |
| entertainment | 94014 | |
| food_dining | 91461 | 7.1% |
| personal_care | 90758 | 7.0% |
| health_fitness | 85879 | 6.6% |
| Other values (4) | 228901 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 52928 |
|---|---|
| Distinct (%) | 4.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 70.35103546 |
| Minimum | 1 |
|---|---|
| Maximum | 28948.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2.44 |
| Q1 | 9.65 |
| median | 47.52 |
| Q3 | 83.14 |
| 95-th percentile | 196.31 |
| Maximum | 28948.9 |
| Range | 28947.9 |
| Interquartile range (IQR) | 73.49 |
Descriptive statistics
| Standard deviation | 160.3160386 |
|---|---|
| Coefficient of variation (CV) | 2.278801407 |
| Kurtosis | 4545.644979 |
| Mean | 70.35103546 |
| Median Absolute Deviation (MAD) | 37.5 |
| Skewness | 42.27787379 |
| Sum | 91222428.9 |
| Variance | 25701.23222 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.14 | 542 | < 0.1% |
| 1.04 | 538 | < 0.1% |
| 1.25 | 535 | < 0.1% |
| 1.02 | 533 | < 0.1% |
| 1.01 | 523 | < 0.1% |
| 1.05 | 519 | < 0.1% |
| 1.2 | 516 | < 0.1% |
| 1.23 | 515 | < 0.1% |
| 1.08 | 512 | < 0.1% |
| 1.11 | 509 | < 0.1% |
| Other values (52918) | 1291433 |
| Value | Count | Frequency (%) |
| 1 | 222 | |
| 1.01 | 523 | |
| 1.02 | 533 | |
| 1.03 | 499 | |
| 1.04 | 538 | |
| 1.05 | 519 | |
| 1.06 | 471 | |
| 1.07 | 498 | |
| 1.08 | 512 | |
| 1.09 | 496 |
| Value | Count | Frequency (%) |
| 28948.9 | 1 | |
| 27390.12 | 1 | |
| 27119.77 | 1 | |
| 26544.12 | 1 | |
| 25086.94 | 1 | |
| 17897.24 | 1 | |
| 15305.95 | 1 | |
| 15047.03 | 1 | |
| 15034.18 | 1 | |
| 14849.74 | 1 |
| Distinct | 352 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.9 MiB |
| Christopher | 26669 |
|---|---|
| Robert | 21667 |
| Jessica | 20581 |
| James | 20039 |
| Michael | 20009 |
| Other values (347) |
Length
| Max length | 11 |
|---|---|
| Median length | 6 |
| Mean length | 6.080431874 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Jennifer |
|---|---|
| 2nd row | Stephanie |
| 3rd row | Edward |
| 4th row | Jeremy |
| 5th row | Tyler |
Common Values
| Value | Count | Frequency (%) |
| Christopher | 26669 | 2.1% |
| Robert | 21667 | 1.7% |
| Jessica | 20581 | 1.6% |
| James | 20039 | 1.5% |
| Michael | 20009 | 1.5% |
| David | 19965 | 1.5% |
| Jennifer | 16940 | 1.3% |
| William | 16371 | 1.3% |
| Mary | 16346 | 1.3% |
| John | 16325 | 1.3% |
| Other values (342) | 1101763 |
Length
| Value | Count | Frequency (%) |
| christopher | 26669 | 2.1% |
| robert | 21667 | 1.7% |
| jessica | 20581 | 1.6% |
| james | 20039 | 1.5% |
| michael | 20009 | 1.5% |
| david | 19965 | 1.5% |
| jennifer | 16940 | 1.3% |
| william | 16371 | 1.3% |
| mary | 16346 | 1.3% |
| john | 16325 | 1.3% |
| Other values (342) | 1101763 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 481 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.9 MiB |
| Smith | 28794 |
|---|---|
| Williams | 23605 |
| Davis | 21910 |
| Johnson | 20034 |
| Rodriguez | 17394 |
| Other values (476) |
Length
| Max length | 11 |
|---|---|
| Median length | 6 |
| Mean length | 6.111177435 |
| Min length | 2 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Banks |
|---|---|
| 2nd row | Gill |
| 3rd row | Sanchez |
| 4th row | White |
| 5th row | Garcia |
Common Values
| Value | Count | Frequency (%) |
| Smith | 28794 | 2.2% |
| Williams | 23605 | 1.8% |
| Davis | 21910 | 1.7% |
| Johnson | 20034 | 1.5% |
| Rodriguez | 17394 | 1.3% |
| Martinez | 14805 | 1.1% |
| Jones | 13976 | 1.1% |
| Lewis | 12753 | 1.0% |
| Gonzalez | 11799 | 0.9% |
| Miller | 11698 | 0.9% |
| Other values (471) | 1119907 |
Length
| Value | Count | Frequency (%) |
| smith | 28794 | 2.2% |
| williams | 23605 | 1.8% |
| davis | 21910 | 1.7% |
| johnson | 20034 | 1.5% |
| rodriguez | 17394 | 1.3% |
| martinez | 14805 | 1.1% |
| jones | 13976 | 1.1% |
| lewis | 12753 | 1.0% |
| gonzalez | 11799 | 0.9% |
| miller | 11698 | 0.9% |
| Other values (471) | 1119907 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
gender
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.9 MiB |
| F | |
|---|---|
| M |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | F |
|---|---|
| 2nd row | F |
| 3rd row | M |
| 4th row | M |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| F | 709863 | |
| M | 586812 |
Length
Pie chart
| Value | Count | Frequency (%) |
| f | 709863 | |
| m | 586812 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 983 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.9 MiB |
| 0069 Robin Brooks Apt. 695 | 3123 |
|---|---|
| 864 Reynolds Plains | 3123 |
| 8172 Robertson Parkways Suite 072 | 3119 |
| 4664 Sanchez Common Suite 930 | 3117 |
| 8030 Beck Motorway | 3113 |
| Other values (978) |
Length
| Max length | 35 |
|---|---|
| Median length | 22 |
| Mean length | 22.22902655 |
| Min length | 12 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 561 Perry Cove |
|---|---|
| 2nd row | 43039 Riley Greens Suite 393 |
| 3rd row | 594 White Dale Suite 530 |
| 4th row | 9443 Cynthia Court Apt. 038 |
| 5th row | 408 Bradley Rest |
Common Values
| Value | Count | Frequency (%) |
| 0069 Robin Brooks Apt. 695 | 3123 | 0.2% |
| 864 Reynolds Plains | 3123 | 0.2% |
| 8172 Robertson Parkways Suite 072 | 3119 | 0.2% |
| 4664 Sanchez Common Suite 930 | 3117 | 0.2% |
| 8030 Beck Motorway | 3113 | 0.2% |
| 29606 Martinez Views Suite 653 | 3112 | 0.2% |
| 1652 James Mews | 3110 | 0.2% |
| 854 Walker Dale Suite 488 | 3107 | 0.2% |
| 40624 Rebecca Spurs | 3106 | 0.2% |
| 594 Berry Lights Apt. 392 | 3101 | 0.2% |
| Other values (973) | 1265544 |
Length
| Value | Count | Frequency (%) |
| apt | 327791 | 6.4% |
| suite | 305467 | 5.9% |
| island | 22954 | 0.4% |
| michael | 18967 | 0.4% |
| common | 17978 | 0.3% |
| station | 17957 | 0.3% |
| islands | 17917 | 0.3% |
| david | 17476 | 0.3% |
| brooks | 16991 | 0.3% |
| fields | 16321 | 0.3% |
| Other values (1940) | 4376722 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 894 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.9 MiB |
| Birmingham | 5617 |
|---|---|
| San Antonio | 5130 |
| Utica | 5105 |
| Phoenix | 5075 |
| Meridian | 5060 |
| Other values (889) |
Length
| Max length | 25 |
|---|---|
| Median length | 8 |
| Mean length | 8.652245937 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Moravian Falls |
|---|---|
| 2nd row | Orient |
| 3rd row | Malad City |
| 4th row | Boulder |
| 5th row | Doe Hill |
Common Values
| Value | Count | Frequency (%) |
| Birmingham | 5617 | 0.4% |
| San Antonio | 5130 | 0.4% |
| Utica | 5105 | 0.4% |
| Phoenix | 5075 | 0.4% |
| Meridian | 5060 | 0.4% |
| Thomas | 4634 | 0.4% |
| Conway | 4613 | 0.4% |
| Cleveland | 4604 | 0.4% |
| Warren | 4599 | 0.4% |
| Houston | 4168 | 0.3% |
| Other values (884) | 1248070 |
Length
| Value | Count | Frequency (%) |
| city | 21314 | 1.3% |
| west | 19473 | 1.2% |
| north | 14425 | 0.9% |
| saint | 14363 | 0.9% |
| falls | 12794 | 0.8% |
| new | 11842 | 0.7% |
| mount | 11375 | 0.7% |
| lake | 11249 | 0.7% |
| san | 10260 | 0.6% |
| springs | 8727 | 0.5% |
| Other values (918) | 1482445 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 51 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.9 MiB |
| TX | |
|---|---|
| NY | 83501 |
| PA | 79847 |
| CA | 56360 |
| OH | 46480 |
| Other values (46) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NC |
|---|---|
| 2nd row | WA |
| 3rd row | ID |
| 4th row | MT |
| 5th row | VA |
Common Values
| Value | Count | Frequency (%) |
| TX | 94876 | 7.3% |
| NY | 83501 | 6.4% |
| PA | 79847 | 6.2% |
| CA | 56360 | 4.3% |
| OH | 46480 | 3.6% |
| MI | 46154 | 3.6% |
| IL | 43252 | 3.3% |
| FL | 42671 | 3.3% |
| AL | 40989 | 3.2% |
| MO | 38403 | 3.0% |
| Other values (41) | 724142 |
Length
| Value | Count | Frequency (%) |
| tx | 94876 | 7.3% |
| ny | 83501 | 6.4% |
| pa | 79847 | 6.2% |
| ca | 56360 | 4.3% |
| oh | 46480 | 3.6% |
| mi | 46154 | 3.6% |
| il | 43252 | 3.3% |
| fl | 42671 | 3.3% |
| al | 40989 | 3.2% |
| mo | 38403 | 3.0% |
| Other values (41) | 724142 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 970 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 48800.6711 |
| Minimum | 1257 |
|---|---|
| Maximum | 99783 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.9 MiB |
Quantile statistics
| Minimum | 1257 |
|---|---|
| 5-th percentile | 7208 |
| Q1 | 26237 |
| median | 48174 |
| Q3 | 72042 |
| 95-th percentile | 94569 |
| Maximum | 99783 |
| Range | 98526 |
| Interquartile range (IQR) | 45805 |
Descriptive statistics
| Standard deviation | 26893.22248 |
|---|---|
| Coefficient of variation (CV) | 0.551083046 |
| Kurtosis | -1.096449332 |
| Mean | 48800.6711 |
| Median Absolute Deviation (MAD) | 23068 |
| Skewness | 0.07968075775 |
| Sum | 6.32786102 × 1010 |
| Variance | 723245415.2 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 73754 | 3646 | 0.3% |
| 34112 | 3613 | 0.3% |
| 48088 | 3597 | 0.3% |
| 82514 | 3527 | 0.3% |
| 49628 | 3123 | 0.2% |
| 15484 | 3123 | 0.2% |
| 85173 | 3119 | 0.2% |
| 29819 | 3117 | 0.2% |
| 38761 | 3113 | 0.2% |
| 5461 | 3112 | 0.2% |
| Other values (960) | 1263585 |
| Value | Count | Frequency (%) |
| 1257 | 2023 | |
| 1330 | 1031 | 0.1% |
| 1535 | 515 | < 0.1% |
| 1545 | 1024 | 0.1% |
| 1612 | 519 | < 0.1% |
| 1843 | 2597 | |
| 1844 | 2058 | |
| 2180 | 519 | < 0.1% |
| 2630 | 2090 | |
| 2908 | 550 | < 0.1% |
| Value | Count | Frequency (%) |
| 99783 | 1568 | |
| 99747 | 12 | < 0.1% |
| 99746 | 540 | < 0.1% |
| 99323 | 2572 | |
| 99160 | 3030 | |
| 99116 | 15 | < 0.1% |
| 99113 | 1047 | 0.1% |
| 99033 | 2458 | |
| 98836 | 524 | < 0.1% |
| 98665 | 500 | < 0.1% |
| Distinct | 968 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.53762161 |
| Minimum | 20.0271 |
|---|---|
| Maximum | 66.6933 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.9 MiB |
Quantile statistics
| Minimum | 20.0271 |
|---|---|
| 5-th percentile | 29.8826 |
| Q1 | 34.6205 |
| median | 39.3543 |
| Q3 | 41.9404 |
| 95-th percentile | 45.8433 |
| Maximum | 66.6933 |
| Range | 46.6662 |
| Interquartile range (IQR) | 7.3199 |
Descriptive statistics
| Standard deviation | 5.075808439 |
|---|---|
| Coefficient of variation (CV) | 0.1317104748 |
| Kurtosis | 0.8129679455 |
| Mean | 38.53762161 |
| Median Absolute Deviation (MAD) | 3.3597 |
| Skewness | -0.1860276801 |
| Sum | 49970770.51 |
| Variance | 25.76383131 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 36.385 | 3646 | 0.3% |
| 26.1184 | 3613 | 0.3% |
| 42.5164 | 3597 | 0.3% |
| 43.0048 | 3527 | 0.3% |
| 44.5995 | 3123 | 0.2% |
| 39.8936 | 3123 | 0.2% |
| 33.2887 | 3119 | 0.2% |
| 34.0326 | 3117 | 0.2% |
| 33.4783 | 3113 | 0.2% |
| 44.3346 | 3112 | 0.2% |
| Other values (958) | 1263585 |
| Value | Count | Frequency (%) |
| 20.0271 | 1527 | |
| 20.0827 | 1032 | 0.1% |
| 24.6557 | 2584 | |
| 26.1184 | 3613 | |
| 26.3304 | 542 | < 0.1% |
| 26.3771 | 518 | < 0.1% |
| 26.4215 | 3038 | |
| 26.4722 | 2524 | |
| 26.529 | 1549 | |
| 26.6939 | 1027 | 0.1% |
| Value | Count | Frequency (%) |
| 66.6933 | 12 | < 0.1% |
| 65.6899 | 540 | < 0.1% |
| 64.7556 | 1568 | |
| 48.8878 | 3030 | |
| 48.8856 | 2066 | |
| 48.8328 | 1533 | |
| 48.6669 | 1047 | 0.1% |
| 48.6031 | 2973 | |
| 48.4786 | 2038 | |
| 48.34 | 3088 |
| Distinct | 969 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -90.22633538 |
| Minimum | -165.6723 |
|---|---|
| Maximum | -67.9503 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 1296675 |
| Negative (%) | 100.0% |
| Memory size | 9.9 MiB |
Quantile statistics
| Minimum | -165.6723 |
|---|---|
| 5-th percentile | -119.0825 |
| Q1 | -96.798 |
| median | -87.4769 |
| Q3 | -80.158 |
| 95-th percentile | -73.5112 |
| Maximum | -67.9503 |
| Range | 97.722 |
| Interquartile range (IQR) | 16.64 |
Descriptive statistics
| Standard deviation | 13.75907695 |
|---|---|
| Coefficient of variation (CV) | -0.1524951323 |
| Kurtosis | 1.855892285 |
| Mean | -90.22633538 |
| Median Absolute Deviation (MAD) | 8.1527 |
| Skewness | -1.150107737 |
| Sum | -116994233.4 |
| Variance | 189.3121984 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -98.0727 | 3646 | 0.3% |
| -81.7361 | 3613 | 0.3% |
| -82.9832 | 3597 | 0.3% |
| -108.8964 | 3527 | 0.3% |
| -79.7856 | 3123 | 0.2% |
| -86.2141 | 3123 | 0.2% |
| -111.0985 | 3119 | 0.2% |
| -82.2027 | 3117 | 0.2% |
| -90.5142 | 3113 | 0.2% |
| -73.098 | 3112 | 0.2% |
| Other values (959) | 1263585 |
| Value | Count | Frequency (%) |
| -165.6723 | 1568 | |
| -156.292 | 540 | < 0.1% |
| -155.488 | 1032 | |
| -155.3697 | 1527 | |
| -153.994 | 12 | < 0.1% |
| -124.4409 | 1043 | |
| -124.2174 | 1547 | |
| -124.1587 | 1031 | |
| -124.1437 | 1526 | |
| -123.9743 | 2036 |
| Value | Count | Frequency (%) |
| -67.9503 | 2080 | |
| -68.5565 | 1014 | 0.1% |
| -69.2675 | 519 | < 0.1% |
| -69.4828 | 2050 | |
| -69.9576 | 537 | < 0.1% |
| -69.9656 | 3107 | |
| -70.1031 | 9 | < 0.1% |
| -70.239 | 1036 | 0.1% |
| -70.3001 | 2090 | |
| -70.3457 | 1527 |
| Distinct | 879 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 88824.44056 |
| Minimum | 23 |
|---|---|
| Maximum | 2906700 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.9 MiB |
Quantile statistics
| Minimum | 23 |
|---|---|
| 5-th percentile | 139 |
| Q1 | 743 |
| median | 2456 |
| Q3 | 20328 |
| 95-th percentile | 525713 |
| Maximum | 2906700 |
| Range | 2906677 |
| Interquartile range (IQR) | 19585 |
Descriptive statistics
| Standard deviation | 301956.3607 |
|---|---|
| Coefficient of variation (CV) | 3.399473825 |
| Kurtosis | 37.6145193 |
| Mean | 88824.44056 |
| Median Absolute Deviation (MAD) | 2198 |
| Skewness | 5.593853067 |
| Sum | 1.151764315 × 1011 |
| Variance | 9.117764376 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 606 | 5496 | 0.4% |
| 1595797 | 5130 | 0.4% |
| 1312922 | 5075 | 0.4% |
| 1766 | 4574 | 0.4% |
| 241 | 4533 | 0.3% |
| 2906700 | 4168 | 0.3% |
| 276002 | 4155 | 0.3% |
| 302 | 4147 | 0.3% |
| 910148 | 4073 | 0.3% |
| 198 | 4067 | 0.3% |
| Other values (869) | 1251257 |
| Value | Count | Frequency (%) |
| 23 | 2049 | |
| 37 | 1013 | 0.1% |
| 43 | 2034 | |
| 46 | 3040 | |
| 47 | 511 | < 0.1% |
| 49 | 1054 | 0.1% |
| 51 | 1016 | 0.1% |
| 52 | 518 | < 0.1% |
| 53 | 2610 | |
| 60 | 1045 | 0.1% |
| Value | Count | Frequency (%) |
| 2906700 | 4168 | |
| 2504700 | 2033 | 0.2% |
| 2383912 | 521 | < 0.1% |
| 1595797 | 5130 | |
| 1577385 | 2563 | |
| 1526206 | 3517 | |
| 1417793 | 8 | < 0.1% |
| 1382480 | 2056 | |
| 1312922 | 5075 | |
| 1263321 | 3629 |
| Distinct | 494 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.9 MiB |
| Film/video editor | 9779 |
|---|---|
| Exhibition designer | 9199 |
| Naval architect | 8684 |
| Surveyor, land/geomatics | 8680 |
| Materials engineer | 8270 |
| Other values (489) |
Length
| Max length | 59 |
|---|---|
| Median length | 19 |
| Mean length | 20.2271024 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Psychologist, counselling |
|---|---|
| 2nd row | Special educational needs teacher |
| 3rd row | Nature conservation officer |
| 4th row | Patent attorney |
| 5th row | Dance movement psychotherapist |
Common Values
| Value | Count | Frequency (%) |
| Film/video editor | 9779 | 0.8% |
| Exhibition designer | 9199 | 0.7% |
| Naval architect | 8684 | 0.7% |
| Surveyor, land/geomatics | 8680 | 0.7% |
| Materials engineer | 8270 | 0.6% |
| Designer, ceramics/pottery | 8225 | 0.6% |
| Systems developer | 7700 | 0.6% |
| IT trainer | 7679 | 0.6% |
| Financial adviser | 7659 | 0.6% |
| Environmental consultant | 7547 | 0.6% |
| Other values (484) | 1213253 |
Length
| Value | Count | Frequency (%) |
| engineer | 131756 | 4.6% |
| officer | 110915 | 3.9% |
| manager | 61124 | 2.1% |
| scientist | 55878 | 1.9% |
| designer | 52218 | 1.8% |
| surveyor | 49062 | 1.7% |
| teacher | 38126 | 1.3% |
| psychologist | 32600 | 1.1% |
| research | 29754 | 1.0% |
| editor | 28725 | 1.0% |
| Other values (456) | 2289024 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 968 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.9 MiB |
| 1977-03-23 | 5636 |
|---|---|
| 1981-08-29 | 4636 |
| 1988-09-15 | 4623 |
| 1955-05-06 | 3661 |
| 1995-07-12 | 3123 |
| Other values (963) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1988-03-09 |
|---|---|
| 2nd row | 1978-06-21 |
| 3rd row | 1962-01-19 |
| 4th row | 1967-01-12 |
| 5th row | 1986-03-28 |
Common Values
| Value | Count | Frequency (%) |
| 1977-03-23 | 5636 | 0.4% |
| 1981-08-29 | 4636 | 0.4% |
| 1988-09-15 | 4623 | 0.4% |
| 1955-05-06 | 3661 | 0.3% |
| 1995-07-12 | 3123 | 0.2% |
| 1983-07-25 | 3123 | 0.2% |
| 1987-10-28 | 3119 | 0.2% |
| 1984-06-03 | 3117 | 0.2% |
| 1999-03-05 | 3113 | 0.2% |
| 1998-03-19 | 3112 | 0.2% |
| Other values (958) | 1259412 |
Length
| Value | Count | Frequency (%) |
| 1977-03-23 | 5636 | 0.4% |
| 1981-08-29 | 4636 | 0.4% |
| 1988-09-15 | 4623 | 0.4% |
| 1955-05-06 | 3661 | 0.3% |
| 1995-07-12 | 3123 | 0.2% |
| 1983-07-25 | 3123 | 0.2% |
| 1987-10-28 | 3119 | 0.2% |
| 1984-06-03 | 3117 | 0.2% |
| 1999-03-05 | 3113 | 0.2% |
| 1998-03-19 | 3112 | 0.2% |
| Other values (958) | 1259412 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 1296675 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.9 MiB |
| 32e4534ec328b0dc06e915376ac45f66 | 1 |
|---|---|
| 0c0598ad26b0a46ef55af16eb1f644ad | 1 |
| d14057dd4916c3020246fc38ef88bc1e | 1 |
| 00b0e841d9d663c50800a3d8a58d89bd | 1 |
| 162f4d53cd0f7ec22f8aca076ffafd7f | 1 |
| Other values (1296670) |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 1296675 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 0b242abb623afc578575680df30655b9 |
|---|---|
| 2nd row | 1f76529f8574734946361c461b024d99 |
| 3rd row | a1a22d70485983eac12b5b88dad1cf95 |
| 4th row | 6b849c168bdad6f867558c3793159a81 |
| 5th row | a41d7549acf90789359a9aa5346dcb46 |
Common Values
| Value | Count | Frequency (%) |
| 32e4534ec328b0dc06e915376ac45f66 | 1 | < 0.1% |
| 0c0598ad26b0a46ef55af16eb1f644ad | 1 | < 0.1% |
| d14057dd4916c3020246fc38ef88bc1e | 1 | < 0.1% |
| 00b0e841d9d663c50800a3d8a58d89bd | 1 | < 0.1% |
| 162f4d53cd0f7ec22f8aca076ffafd7f | 1 | < 0.1% |
| f8782fa5f2053f9a77fd8ecfbe7fa9cf | 1 | < 0.1% |
| 98af7a381e33ad6802cdb2f1c92e3675 | 1 | < 0.1% |
| 0e2fef0cf6ff9150cb15508adc002f26 | 1 | < 0.1% |
| 98b518878be5dd03e12495f5b701ee11 | 1 | < 0.1% |
| b4a11538e13cb33e5c2f4dcf6c10cf7c | 1 | < 0.1% |
| Other values (1296665) | 1296665 |
Length
| Value | Count | Frequency (%) |
| f7eff44f07d5da2fcaffb88a9ecb97d3 | 1 | < 0.1% |
| 58655490d3993406721066dcd484eaf8 | 1 | < 0.1% |
| e5fc727fed2d17134bd12ddac6051d55 | 1 | < 0.1% |
| 4f7842851f43cb611813026ea3af731f | 1 | < 0.1% |
| b9707bb1ccd11bfd1a30d1e6d67a9c03 | 1 | < 0.1% |
| 5787bd42842877f928d2e57a33e4512a | 1 | < 0.1% |
| c7dc28f67636b7421cf50662346bc7ca | 1 | < 0.1% |
| 3dde4f84a3cc21ce78f425b8bcd9ff34 | 1 | < 0.1% |
| 1778f7152cada79befdfb42b8b211287 | 1 | < 0.1% |
| 22efdcfff6586eae277a75a12b204c0a | 1 | < 0.1% |
| Other values (1296665) | 1296665 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
unix_time
Real number (ℝ≥0)
| Distinct | 1274823 |
|---|---|
| Distinct (%) | 98.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1349243637 |
| Minimum | 1325376018 |
|---|---|
| Maximum | 1371816817 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.9 MiB |
Quantile statistics
| Minimum | 1325376018 |
|---|---|
| 5-th percentile | 1328671975 |
| Q1 | 1338750742 |
| median | 1349249747 |
| Q3 | 1359385376 |
| 95-th percentile | 1369830595 |
| Maximum | 1371816817 |
| Range | 46440799 |
| Interquartile range (IQR) | 20634633 |
Descriptive statistics
| Standard deviation | 12841278.42 |
|---|---|
| Coefficient of variation (CV) | 0.009517390391 |
| Kurtosis | -1.087540501 |
| Mean | 1349243637 |
| Median Absolute Deviation (MAD) | 10358807 |
| Skewness | 0.003377949757 |
| Sum | 1.749530493 × 1015 |
| Variance | 1.648984315 × 1014 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 1335110521 | 4 | < 0.1% |
| 1370177227 | 4 | < 0.1% |
| 1370050667 | 4 | < 0.1% |
| 1356144337 | 3 | < 0.1% |
| 1334432745 | 3 | < 0.1% |
| 1354630600 | 3 | < 0.1% |
| 1349404036 | 3 | < 0.1% |
| 1338044031 | 3 | < 0.1% |
| 1334966271 | 3 | < 0.1% |
| 1345129731 | 3 | < 0.1% |
| Other values (1274813) | 1296642 |
| Value | Count | Frequency (%) |
| 1325376018 | 1 | |
| 1325376044 | 1 | |
| 1325376051 | 1 | |
| 1325376076 | 1 | |
| 1325376186 | 1 | |
| 1325376248 | 1 | |
| 1325376282 | 1 | |
| 1325376308 | 1 | |
| 1325376318 | 1 | |
| 1325376361 | 1 |
| Value | Count | Frequency (%) |
| 1371816817 | 1 | |
| 1371816816 | 1 | |
| 1371816752 | 1 | |
| 1371816739 | 1 | |
| 1371816728 | 1 | |
| 1371816696 | 1 | |
| 1371816683 | 1 | |
| 1371816656 | 1 | |
| 1371816562 | 1 | |
| 1371816522 | 1 |
| Distinct | 1247805 |
|---|---|
| Distinct (%) | 96.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.53733804 |
| Minimum | 19.027785 |
|---|---|
| Maximum | 67.510267 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.9 MiB |
Quantile statistics
| Minimum | 19.027785 |
|---|---|
| 5-th percentile | 29.7516534 |
| Q1 | 34.733572 |
| median | 39.36568 |
| Q3 | 41.957164 |
| 95-th percentile | 46.0035301 |
| Maximum | 67.510267 |
| Range | 48.482482 |
| Interquartile range (IQR) | 7.223592 |
Descriptive statistics
| Standard deviation | 5.10978837 |
|---|---|
| Coefficient of variation (CV) | 0.1325931844 |
| Kurtosis | 0.79599391 |
| Mean | 38.53733804 |
| Median Absolute Deviation (MAD) | 3.397536 |
| Skewness | -0.1819154297 |
| Sum | 49970402.81 |
| Variance | 26.10993718 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 34.048439 | 4 | < 0.1% |
| 42.749184 | 4 | < 0.1% |
| 41.568128 | 4 | < 0.1% |
| 40.772096 | 4 | < 0.1% |
| 41.632488 | 4 | < 0.1% |
| 41.910192 | 4 | < 0.1% |
| 39.845849 | 4 | < 0.1% |
| 40.550199 | 4 | < 0.1% |
| 37.669788 | 4 | < 0.1% |
| 40.277086 | 4 | < 0.1% |
| Other values (1247795) | 1296635 |
| Value | Count | Frequency (%) |
| 19.027785 | 1 | |
| 19.027804 | 1 | |
| 19.029798 | 1 | |
| 19.031242 | 1 | |
| 19.032277 | 1 | |
| 19.033288 | 1 | |
| 19.034282 | 1 | |
| 19.034687 | 1 | |
| 19.035472 | 1 | |
| 19.036312 | 1 |
| Value | Count | Frequency (%) |
| 67.510267 | 1 | |
| 67.441518 | 1 | |
| 67.397018 | 1 | |
| 67.188111 | 1 | |
| 67.064277 | 1 | |
| 66.835174 | 1 | |
| 66.682905 | 1 | |
| 66.67355 | 1 | |
| 66.664673 | 1 | |
| 66.659242 | 1 |
| Distinct | 1275745 |
|---|---|
| Distinct (%) | 98.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -90.2264648 |
| Minimum | -166.671242 |
|---|---|
| Maximum | -66.950902 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 1296675 |
| Negative (%) | 100.0% |
| Memory size | 9.9 MiB |
Quantile statistics
| Minimum | -166.671242 |
|---|---|
| 5-th percentile | -119.3300916 |
| Q1 | -96.8972755 |
| median | -87.438392 |
| Q3 | -80.2367965 |
| 95-th percentile | -73.3542179 |
| Maximum | -66.950902 |
| Range | 99.72034 |
| Interquartile range (IQR) | 16.660479 |
Descriptive statistics
| Standard deviation | 13.77109056 |
|---|---|
| Coefficient of variation (CV) | -0.1526280631 |
| Kurtosis | 1.848479176 |
| Mean | -90.2264648 |
| Median Absolute Deviation (MAD) | 8.227889 |
| Skewness | -1.146959945 |
| Sum | -116994401.2 |
| Variance | 189.6429353 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -87.116414 | 4 | < 0.1% |
| -74.618269 | 4 | < 0.1% |
| -81.219189 | 4 | < 0.1% |
| -86.763294 | 3 | < 0.1% |
| -80.575864 | 3 | < 0.1% |
| -81.210015 | 3 | < 0.1% |
| -73.634879 | 3 | < 0.1% |
| -82.283919 | 3 | < 0.1% |
| -81.458097 | 3 | < 0.1% |
| -80.900899 | 3 | < 0.1% |
| Other values (1275735) | 1296642 |
| Value | Count | Frequency (%) |
| -166.671242 | 1 | |
| -166.670132 | 1 | |
| -166.669638 | 1 | |
| -166.666179 | 1 | |
| -166.664828 | 1 | |
| -166.662888 | 1 | |
| -166.661968 | 1 | |
| -166.659277 | 1 | |
| -166.657834 | 1 | |
| -166.657174 | 1 |
| Value | Count | Frequency (%) |
| -66.950902 | 1 | |
| -66.955996 | 1 | |
| -66.95654 | 1 | |
| -66.958659 | 1 | |
| -66.958751 | 1 | |
| -66.959178 | 1 | |
| -66.961923 | 1 | |
| -66.962913 | 1 | |
| -66.963918 | 1 | |
| -66.963975 | 1 |
is_fraud
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.9 MiB |
| 0 | |
|---|---|
| 1 | 7506 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1289169 | |
| 1 | 7506 | 0.6% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 1289169 | |
| 1 | 7506 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| trans_date_trans_time | cc_num | merchant | category | amt | first | last | gender | street | city | state | zip | lat | long | city_pop | job | dob | trans_num | unix_time | merch_lat | merch_long | is_fraud | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2019-01-01 00:00:18 | 2703186189652095 | fraud_Rippin, Kub and Mann | misc_net | 4.97 | Jennifer | Banks | F | 561 Perry Cove | Moravian Falls | NC | 28654 | 36.0788 | -81.1781 | 3495 | Psychologist, counselling | 1988-03-09 | 0b242abb623afc578575680df30655b9 | 1325376018 | 36.011293 | -82.048315 | 0 |
| 1 | 2019-01-01 00:00:44 | 630423337322 | fraud_Heller, Gutmann and Zieme | grocery_pos | 107.23 | Stephanie | Gill | F | 43039 Riley Greens Suite 393 | Orient | WA | 99160 | 48.8878 | -118.2105 | 149 | Special educational needs teacher | 1978-06-21 | 1f76529f8574734946361c461b024d99 | 1325376044 | 49.159047 | -118.186462 | 0 |
| 2 | 2019-01-01 00:00:51 | 38859492057661 | fraud_Lind-Buckridge | entertainment | 220.11 | Edward | Sanchez | M | 594 White Dale Suite 530 | Malad City | ID | 83252 | 42.1808 | -112.2620 | 4154 | Nature conservation officer | 1962-01-19 | a1a22d70485983eac12b5b88dad1cf95 | 1325376051 | 43.150704 | -112.154481 | 0 |
| 3 | 2019-01-01 00:01:16 | 3534093764340240 | fraud_Kutch, Hermiston and Farrell | gas_transport | 45.00 | Jeremy | White | M | 9443 Cynthia Court Apt. 038 | Boulder | MT | 59632 | 46.2306 | -112.1138 | 1939 | Patent attorney | 1967-01-12 | 6b849c168bdad6f867558c3793159a81 | 1325376076 | 47.034331 | -112.561071 | 0 |
| 4 | 2019-01-01 00:03:06 | 375534208663984 | fraud_Keeling-Crist | misc_pos | 41.96 | Tyler | Garcia | M | 408 Bradley Rest | Doe Hill | VA | 24433 | 38.4207 | -79.4629 | 99 | Dance movement psychotherapist | 1986-03-28 | a41d7549acf90789359a9aa5346dcb46 | 1325376186 | 38.674999 | -78.632459 | 0 |
| 5 | 2019-01-01 00:04:08 | 4767265376804500 | fraud_Stroman, Hudson and Erdman | gas_transport | 94.63 | Jennifer | Conner | F | 4655 David Island | Dublin | PA | 18917 | 40.3750 | -75.2045 | 2158 | Transport planner | 1961-06-19 | 189a841a0a8ba03058526bcfe566aab5 | 1325376248 | 40.653382 | -76.152667 | 0 |
| 6 | 2019-01-01 00:04:42 | 30074693890476 | fraud_Rowe-Vandervort | grocery_net | 44.54 | Kelsey | Richards | F | 889 Sarah Station Suite 624 | Holcomb | KS | 67851 | 37.9931 | -100.9893 | 2691 | Arboriculturist | 1993-08-16 | 83ec1cc84142af6e2acf10c44949e720 | 1325376282 | 37.162705 | -100.153370 | 0 |
| 7 | 2019-01-01 00:05:08 | 6011360759745864 | fraud_Corwin-Collins | gas_transport | 71.65 | Steven | Williams | M | 231 Flores Pass Suite 720 | Edinburg | VA | 22824 | 38.8432 | -78.6003 | 6018 | Designer, multimedia | 1947-08-21 | 6d294ed2cc447d2c71c7171a3d54967c | 1325376308 | 38.948089 | -78.540296 | 0 |
| 8 | 2019-01-01 00:05:18 | 4922710831011201 | fraud_Herzog Ltd | misc_pos | 4.27 | Heather | Chase | F | 6888 Hicks Stream Suite 954 | Manor | PA | 15665 | 40.3359 | -79.6607 | 1472 | Public affairs consultant | 1941-03-07 | fc28024ce480f8ef21a32d64c93a29f5 | 1325376318 | 40.351813 | -79.958146 | 0 |
| 9 | 2019-01-01 00:06:01 | 2720830304681674 | fraud_Schoen, Kuphal and Nitzsche | grocery_pos | 198.39 | Melissa | Aguilar | F | 21326 Taylor Squares Suite 708 | Clarksville | TN | 37040 | 36.5220 | -87.3490 | 151785 | Pathologist | 1974-03-28 | 3b9014ea8fb80bd65de0b1463b00b00e | 1325376361 | 37.179198 | -87.485381 | 0 |
Last rows
| trans_date_trans_time | cc_num | merchant | category | amt | first | last | gender | street | city | state | zip | lat | long | city_pop | job | dob | trans_num | unix_time | merch_lat | merch_long | is_fraud | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1296665 | 2020-06-21 12:08:42 | 213193596103206 | fraud_Gulgowski LLC | home | 72.17 | James | Hunt | M | 7369 Gabriel Tunnel | Pointe Aux Pins | MI | 49775 | 45.7549 | -84.4470 | 95 | Electrical engineer | 1994-02-09 | 108c103b26f686c24c021aaf4210977e | 1371816522 | 44.938461 | -83.996234 | 0 |
| 1296666 | 2020-06-21 12:09:22 | 4587657402165341815 | fraud_Hyatt, Russel and Gleichner | health_fitness | 7.30 | Amber | Lewis | F | 6296 John Keys Suite 858 | Pembroke Township | IL | 60958 | 41.0646 | -87.5917 | 2135 | Psychotherapist, child | 2004-05-08 | 37a18c6fb0c5c722b6339ffedc82f55a | 1371816562 | 40.556811 | -88.092339 | 0 |
| 1296667 | 2020-06-21 12:10:56 | 4822367783500458 | fraud_Hahn, Douglas and Schowalter | travel | 19.71 | Christopher | Farrell | M | 97070 Anderson Land | Haines City | FL | 33844 | 28.0758 | -81.5929 | 33804 | Exercise physiologist | 1991-01-01 | 34e72e0a659a6c8f4a20ee65594f3a7d | 1371816656 | 27.465871 | -81.511804 | 0 |
| 1296668 | 2020-06-21 12:11:23 | 213141712584544 | fraud_Metz, Russel and Metz | kids_pets | 100.85 | Margaret | Curtis | F | 742 Oneill Shore | Florence | MS | 39073 | 32.1530 | -90.1217 | 19685 | Fine artist | 1984-12-24 | 0d86d8c17638d7eff77db9c6a878b477 | 1371816683 | 31.377697 | -90.528450 | 0 |
| 1296669 | 2020-06-21 12:11:36 | 4400011257587661852 | fraud_Stiedemann Inc | misc_pos | 37.38 | Marissa | Powell | F | 474 Allen Haven | North Loup | NE | 68859 | 41.4972 | -98.7858 | 509 | Nurse, children's | 1980-09-15 | 9a7ea2625cf8303efe34e3c09546868f | 1371816696 | 41.728638 | -99.039660 | 0 |
| 1296670 | 2020-06-21 12:12:08 | 30263540414123 | fraud_Reichel Inc | entertainment | 15.56 | Erik | Patterson | M | 162 Jessica Row Apt. 072 | Hatch | UT | 84735 | 37.7175 | -112.4777 | 258 | Geoscientist | 1961-11-24 | 440b587732da4dc1a6395aba5fb41669 | 1371816728 | 36.841266 | -111.690765 | 0 |
| 1296671 | 2020-06-21 12:12:19 | 6011149206456997 | fraud_Abernathy and Sons | food_dining | 51.70 | Jeffrey | White | M | 8617 Holmes Terrace Suite 651 | Tuscarora | MD | 21790 | 39.2667 | -77.5101 | 100 | Production assistant, television | 1979-12-11 | 278000d2e0d2277d1de2f890067dcc0a | 1371816739 | 38.906881 | -78.246528 | 0 |
| 1296672 | 2020-06-21 12:12:32 | 3514865930894695 | fraud_Stiedemann Ltd | food_dining | 105.93 | Christopher | Castaneda | M | 1632 Cohen Drive Suite 639 | High Rolls Mountain Park | NM | 88325 | 32.9396 | -105.8189 | 899 | Naval architect | 1967-08-30 | 483f52fe67fabef353d552c1e662974c | 1371816752 | 33.619513 | -105.130529 | 0 |
| 1296673 | 2020-06-21 12:13:36 | 2720012583106919 | fraud_Reinger, Weissnat and Strosin | food_dining | 74.90 | Joseph | Murray | M | 42933 Ryan Underpass | Manderson | SD | 57756 | 43.3526 | -102.5411 | 1126 | Volunteer coordinator | 1980-08-18 | d667cdcbadaaed3da3f4020e83591c83 | 1371816816 | 42.788940 | -103.241160 | 0 |
| 1296674 | 2020-06-21 12:13:37 | 4292902571056973207 | fraud_Langosh, Wintheiser and Hyatt | food_dining | 4.30 | Jeffrey | Smith | M | 135 Joseph Mountains | Sula | MT | 59871 | 45.8433 | -113.8748 | 218 | Therapist, horticultural | 1995-08-16 | 8f7c8e4ab7f25875d753b422917c98c9 | 1371816817 | 46.565983 | -114.186110 | 0 |